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(57) ABSTRACT 

A network system includes a content provider connected to 
local service providers via an interactive distribution 
network, such as the Internet. The local service providers 
facilitate delivery of the content from the content provider to 
multiple subscribers. The local service providers schedule 
delivery of frequently requested content from the content 
provider prior to a peak lime when the subscribers are likely 
to request the content. The content is downloaded from the 
content provider during the off-peak hours and cached at the 
local service providers for serving to the subscribers during 
the ensuing peak time. In this manner, the frequently 
requested content is already present at the local service 
providers and ready to be served to the subscribers before 
they actually request it. When the content is finally 
requested, the data is streamed continuously in real-time for 
just-in-time rendering at the subscribers computer. Another 
aspect of this invention involves supplementing content 
delivery over the Internet with delivery of content over a 
secondary network, such as a broadcast satellite network. 
The supplemental broadcast link offers additional bandwidth 
at a fraction of the cost that would be incurred if the local 
service provider installed additional Internet connections, 
such as Tl or T3 connections. 

7 Claims, 6 Drawing Sheets 
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PULL BASED, INTELLIGENT CACHING 
SYSTEM AND METHOD 

This is a divisional of U.S. patent application Ser. No.08/ 
703,487, filed Aug. 26, 1996, which is now U.S. Pat. No. 5 
5,991,306. 

TECHNICAL FIELD 

This invention relates to network systems, and particu- 
larly public network systems, such as the Internet More jo 
particularly, this invention relates to methods which improve 
distribution of streaming continuous data (e.g., audio and 
video data) from a content provider over a network to a 
subscriber's computer or other content rendering unit. 

BACKGROUND OF THE INVENTION 15 

Public networks, and most notably the Internet, are 
emerging as a primary conduit for communications, 
entertainment, and business services. The Internet is a net- 
work formed by the cooperative interconnection of comput- 
ing networks, including local and wide area networks. It 
interconnects computers from around the world with exist- 
ing and even incompatible technologies by employing com- 
mon protocols &hat smoothly integrate the individual and 
diverse components. 25 

The Internet has recently been popularized by the over- 
whelming and rapid success of the World Wide Web (WWW 
or Web). The Web is a graphical user interface to the Internet 
that facilitates interaction between users and the Internet The 
Web links together various topics in a complex, non- 30 
sequential web of associations which permit a user to 
browse from one topic to another, regardless of the presented 
order of topics. A "Web browser" is an application which 
executes on the user's computer to navigate the Web. The 
Web browser allows a user to retrieve and render hyperme- 3S 
dia content from the WWW, including text, sound, images, 
video, and other data. 

One problem facing the continued growth and acceptance 
of the Internet concerns dissemination of streaming continu- 
ous data, such as video and audio content. Data is delivered 40 
and rendered to users in essentially two formats. The first 
format referred to as "block data " entails downloading the 
entire data set to local storage and then rendering the data 
from the locally stored copy. A second delivery format, 
known as "streaming data," entails sending bits of data 45 
continuously over the network for just-in-timc rendering. 

Computer network users have been conditioned through 
their experiences with television and CD-ROM multimedia 
applications to expect instantaneous streaming data on 
demand. For technical reasons, however, the Internet is often 50 
unable to deliver streaming data. This inability is most 
pronounced for video data. In the Internet context, there is 
often long delays between the time video content is 
requested and the time when the video content actually 
begins playing. It is not uncommon to wait several minutes 55 
for a video file to begin playing. In essence, for factors 
discussed below, video data is traditionally delivered as 
"block data" over the Internet and thus requires that the 
entire file be downloaded prior to rendering. 

The inability to provide streaming data is a result of too 60 
little bandwidth in the distribution network. "Bandwidth" is 
the amount of data that can be moved through a particular 
network segment at any one time. The Internet is a con- 
glomerate of different technologies with different associated 
bandwidths. Distribution over the Internet is usually con- 65 
strained by the segment with the lowest available band- 
width. 
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FIG. 1 shows a model of a public network system 20, such 
as the Internet. The network system 20 includes a content 
server 22 (e.g., a Web server) which stores and serves 
multimedia data over a distribution network 24. The network 
system 20 also has regional independent service providers 
(ISPs) or point of presence (POP) operators, as represented 
by ISP 26, which provide the connectivity to the primary 
distribution network 24. Many users, as represented by 
subscriber computers 28, 30, and 32, are connected to the 
ISP 26 to gain access to the Internet. 

The ISP 26 is connected to the distribution network 24 
with a network connection 34. In this example illustration, 
the network connection 34 is a "Tl" connection. "Tl" is a 
it of bandwidth having a base throughput speed of a approxi- 
mately 1 .5 Mbps (Megabits per second). Another common 
high bandwidth connection is a 13 connection, which has a 
base throughput speed of approximately 44,7 Mbps. For 
purposes of explaining the state of the technology and the 
practical problems with providing real-time streaming data 
over the Internet, it is sufficient to understand that there is 
also a limited bandwidth connection between the content 
server 22 and the distribution network 24. 

The subscriber computers 28, 30, and 32 are connected to 
their host ISP 26 via home entry lines, such as telephone or 
cable lines, and compatible modems. As examples of com- 
mercially available technology, subscriber computer 28 is 
connected to ISP 26 over a 14.4 K connection 36 which 
consists of a standard telephone line and a V. 32b is modem 
to enable a maximum data rate of 14.4 Kbps (Kilobits per 
second). Subscriber computer 30 is connected to the ISP 26 
with a 28.8 K connection 38 (telephone line and V.34 
modem) which supports a data rate of 28.8 Kbps. Subscriber 
computer 32 is connected to the ISP 26 with an ISDN 
connection 40 which is a special type of telephone line that 
facilitates data flow in the range of 128-132 Kbps. Table 1 
summarizes connection technologies that are available 
today. 

TABLE 1 



Connection Technologies and Throughput 



Connection Type Base Speed (Kbps) 



V.32bis modem 


14.4 


V.34 modem 


28.8 


S6K Leased Line 


56 


ISDN BRI (1 channel) 


56-64 


[SDN BRI (2 channels) 


128-132 


Frame Relay 


56-1,544 


Fractional Tl 


256-1,280 


ISDN PRI 


1,544 


Full Tl (24 channels) 


1,544 


ADSL 


2,000-6,000 


Cable Modem 


27,000 


T3 


44,736 



With a Tl connection to the primary distribution network 
24, the ISP 26 can facilitate a maximum data flow of 
approximately 1.5 Mbps. This bandwidth is available to 
serve all of the subscribers of the ISP. When subscriber 
computer 28 is connected and downloading data files, it 
requires a 14.4 Kbps slice of the 1.5 Mbps bandwidth. 
Subscriber computers 30 and 32 consume 28.8 Kbps and 
128 Kbps slices, respectively, of the available bandwidth 

The ISP can accommodate simultaneous requests from a 
number of subscribers. As more subscribers utilize the ISP 
services, however, there is less available bandwidth to 
satisfy the subscribers requests. If too many requests are 
received, the ISP becomes overburdened and may not be 
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able to adequately service the requests in a timely manner, 
causing frustration to the subscribers. If latency problems 
persist, the ISP can purchase more bandwidth by adding 
additional capacity (e.g., upgrading to a T3 connection or 
adding more Tl connections). Unfortunately, adding more 5 
bandwidth may;not be economically wise for the ISP. The 
load placed on the ISP typically fluctuates throughout dif- 
ferent times of the day Adding expensive bandwidth to more 
readily service short duration high-demand times may not be 
profitable if the present capacity adequately services the 10 
subscriber traffic during most of the day. 

The latency problems are perhaps the most pronounced 
when working with video. There are few things more 
frustrating to a user than trying to download video over the 
Internet. The problem is that video requires large bandwidth 15 
in comparison to text files, graphics, and pictures. 
Additionally, unlike still images or text files, video is pre- 
sented as moving images which are played continuously 
without interruption. Video typically requires a 1.2 Mbps for 
real-time streaming is data This 1.2 Mbps throughput 20 
requirement consumes nearly all of a Tl bandwidth (1.5 
Mbps). Accordingly, when multiple subscribers are coupled 
to the ISP and one subscriber requests a video file, there is 
generally not enough capacity to stream the video in real- 
time from the content server 22 over the Internet to the 25 
requesting subscriber. Instead, the video file is typically 
delivered in its entirety and only then played on the sub- 
scriber computer. Unfortunately, even downloading video 
files in the block data format is often inconvenient and 
usually requires an excessive amount of time. 30 

Consider the following example. Suppose a subscriber 
wishes to access the CNN Web site on the Internet for an 
account of recent news. As part of the news materials, CNN 
provides a twenty second video clip of an airplane hijacking 
incident. At 1.2 Mbps, the 20 second video clip involves 35 
downloading a 24 Mbyte file over the Internet. If the user has 
a modest 14.4 Kbps connection, it would take approximately 
28 minutes to download the entire file. 

Now, assume that the subscriber/ISP connection is sufiB- 4Q 
ciently large to handle real-time video streaming of the video 
file, meaning that the subscriber computer can render the 
video data as it is received from the ISP. Despite the 
bandwidth of the subscriber/ISP connection, real-time video 
streaming may still be unachievable if the Tl connection 34 45 
between the ISP 26 and the distribution network 24 is 
unable, or unwilling due to policy reasons, to dedicate 1.2 
Mbps of its bandwidth to the video file. Requests for the 
CNN video clip made during peak traffic times at the ISP 
most certainly could not be accommodated by the ISP/ 5Q 
network connection Since adding more bandwidth may be a 
poor investment for the ISP, the ISP may have no economic 
incentive to remedy the latency problem. The result is that 
some users might be inconvenienced by the lack of ability to 
receive streaming video despite their own connection to the 5S 
ISP being capable of accommodating streaming video. 

The latency problem is further aggravated if the connec- 
tion between the content server 22 and the distribution 
network 24 is equally taxed. The lack of sufficient band- 
width at the content server/network link could also prevent 60 
real-time video streaming over the Internet, regardless of the 
band widths of the network/ISP link or the ISP/subscriber 
link. If all links lack sufficient bandwidth, the latency 
problem can be compounded. 

One solution to this problem is to provide local cache 65 
storage at the ISP. As subscribers request files from the 
Internet, the ISP caches the files locally so that subsequent 
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requests are handled in a more expeditious manner. This 
process is known as "on-demand caching." Local 
on-demand caching methods improve the ability to deliver 
video content over the Internet When the first subscriber 
requests the CNN video clip of the airplane hijacking 
incident, the ISP requests the video clip from the CNN 
server, and facilitates delivery of the video clip to the 
requesting subscriber. The ISP also caches the video clip in 
its own memory. When any subsequent subscriber requests 
the same CNN video clip, the ISP serves the local version of 
the video clip from its own cache, rather than requesting the 
clip from the CNN server. If the subscriber computer has a 
high bandwidth connection with the ISP, the locally stored 
video clip can be served as continuous streaming video data 
for instantaneous rendering on the subscriber computer. 

A drawback of the on-demand caching method is that the 
first requesting subscriber is faced with the same latency 
problems described above. All subsequent subscribers have 
the benefit of the cached version. However, if the initial 
delay is too long, there may not be any subscriber who is 
willing to assume the responsibility of ordering the video file 
and then waiting for it to download 

Accordingly, there remains a need to develop improved 
techniques for facilitating distribution of streaming video 
over public networks, such as the Internet. 

SUMMARY OF THE INVENTION 

This invention provides improved methods for delivering 
large amounts of data, such as streaming audio and video 
data, over a network, such as the Internet. According to one 
aspect, the method involves an intelligent, preaching and 
pre-loading of frequently requested content to the local 
service provider (e.g., ISP or LAN network server) prior to 
peak demand times when the content is likely to be 
requested by the subscribers. In this manner, the frequently 
requested content is already downloaded and ready to be 
served to the subscribers before they actually request it. 
When the content is finally requested, the data is streamed 
continuously in real-time for just-in-time rendering at the 
subscriber. This eliminates the latency problems of prior art 
systems because the subscribers do not have to wait for the 
downloading of video and audio files over the Internet 
Moreover, intelligently pre -caching content before peak 
demand times is more effective than traditional on-demand 
caching because the content is available to the first sub- 
scriber who requests it. 

In one implementation, the network system includes a 
content provider connected to local service providers via a 
distribution network. The local service providers facilitate 
delivery of the content from the content provider to multiple 
subscribers. The local service providers are configured to 
request certain content from the content provider prior to a 
peak time when the subscribers are likely to request the 
content The content is downloaded from the content pro- 
vider during non-peak hours and cached at the local service 
providers for serving to the subscribers during the ensuing 
peak time. 

The local service provider includes a processing control 
unit, a cache memory, and a continuous media server. A hit 
recording module executes on the processing control unit to 
record requests for particular content from the subscribers. 
In the Internet context, these requests are submitted in the 
form of URLs (universal resource locators) for target 
resources located on the Web. A pattern recognizer detects 
behavior patterns based on subscriber requests to determine 
which content the subscribers are most likely to request and 
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when. Ascheduler then schedules requests for the frequently 
requested content from the content provider at a selected 
time prior to the peak demand time for that content. These 
requests arc posted to the content provider at their scheduled 
times, and the content provider downloads the content 
during the off-hours prior to the peak time. 

When the content is received from the content provider, 
the local service provider stores the content in the cache 
memory. For instance, the content might be a Web page from 
a frequently visited Web site. Web pages are typically 
designed as hypermedia documents to provide rich multi- 
media presentations which blend text, images, sound, and 
video. If the Web page references or includes continuous 
data files, such as audio or video files, these files are stored 
in a continuous media server. The target specifications 
embedded in the Web page to reference the continuous data 
files are modified to reference the local copy of the continu- 
ous data files, as opposed to the original location of the files 
at the Web site. 

During the ensuing peak time, the processing control unit 
serves the target resources maintained in the cache memory 
to the subscribers. If any subscriber clicks on or otherwise 
activates a link to an audio or video file, the requested file 
is served as a continuous stream of data from the continuous 
media server at the ISP. In this maimer, the continuous video 
or audio data stream can be rendered just-in-time by the 
subscriber. 

Another aspect of this invention involves supplementing 
the primary Internet connection owned by the ISP with a 
delivery of content over a secondary network. This supple- 
mental delivery effectively increases bandwidth between the 
content provider and the local service provider. In the 
described implementation, the content provider broadcasts 
additional content over a broadcast satellite network to the 
local service provider. The broadcast communication link 
offers additional bandwidth at a fraction of the cost that 
would be incurred if the local service provider installed 
additional Internet connections, such as Tl or T3 connec- 
tions. The broadcasted content is stored at the local service 
provider and served during peak times to afford continuous 
audio streaming to the subscribers. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a diagrammatic illustration of a network system 
which is used to explain the present state of Internet tech- 
nology. 

FIG. 2 is a diagrammatic illustration of a network system 
constructed according to one implementation of this inven- 
tion. 

FIG. 3 is a diagrammatic illustration of a network system 
constructed according to another implementation of this 
invention. 

FIG. 4 is a block diagram of the functional components in 
a local service provider in the network system. 

FIG. 5 is a flow diagram of a method for operating the 
local service provider. 

FIG. 6 is a diagrammatic illustration of a network system 
according to still another implementation of this invention. 

The same reference numbers are used throughout the 
figures to reference like components and features. 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENT 

FIG. 2 shows a public network system 50. It includes 
multiple content servers, as represented by content server 
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52, which store and serve content over a network 54. The 
content server 52 serves content in the form of text, audio, 
video, graphic images, and other multimedia data In the 
Internet context, the content servers might represent Web 

S sites which serve or multicast content in the form of hyper- 
media documents (e.g., Web page) which link text, images, 
sounds, and actions in a web of associations that permit a 
user to browse through related topics, regardless of the 
presented order of the topics. The content server 52 might 

1(J alternatively represent headend servers for a cable company 
which transmit video content over a cable network, or an 
audio server for a radio station that sends audio data over the 
network. The content server 52 might further represent 
servers for educational institutions, public agencies, 
libraries, merchants, or any other public or private organi- 

15 zations which serve or multicast information over the net- 
work. 

The network 54 is a high-speed, high-bandwidth interac- 
tive distribution network, and can be representative of the 
Internet Traffic over the network 54 is organized according 

20 to protocols which define how and when data is moved. One 
example protocol is the mission control protocol/Internet 
protocol (TCP/IP) which forms the backbone of the Internet. 
The network 54 might be implemented using various physi- 
cal mediums, including wirebased technologies (e.g., cable, 

25 telephone lines, etc.) and wireless technologies (e.g., 
satellite, cellular, infrared, etc.). The network is operated 
according to high-speed switching services, including 
connection-oriented network services (e.g., frame relay, 
asynchronous transfer mode (ATM), etc.) and connection- 

30 less services (e.g., switched multimegabit data service, etc.). 
These switching services support connection speeds of sev- 
eral Megabits per second (Mbps), up to Gigabits per second 
(Gbps). At these speeds, the network 54 is capable of 
supporting streaming video data which requires 1.2 Mbps. 

3j Many independent service providers (ISPs), as repre- 
sented by ISP 56, function as terminal connections or 
"on-ramps" to the high-speed network 54. The ISP 56 acts 
as an intermediary between the subscribers 58 and 60 and 
the network 54. The ISP 56 has a network port 62 which 

40 provides a high-speed, high-bandwidth connection 64 to the 
network 54. The ISPs segment and rent portions of the 
bandwidth to the multiple subscribers 58 and 60 so that the 
subscribers do not individually need to purchase and main- 
tain their own network connections. The ISPs 56 may also 

45 be referred to as point of presence (POP) servers, and the 
names "ISP" and "POP" are used interchangeably in this 
disclosure. 

The subscriber personal computers (PCs) 58 and 60 are 
individually connected to the ISP 56 by permanent or 

50 sessional dial-up connections. Conventional telephone or 
cable lines and compatible modems are used to form the 
connections 66, 68. Examples of suitable technologies 
include HFC, ISDN, POTS, and ADSL. The ISP 56 has 
network terminal switching equipment 70 to accommodate 

55 the connections to the subscriber PCs 58, 60. 

The ISP 56 also has a cache server 72 and a continuous 
media server (CMS) 74. The cache server 72 is configured 
as a conventional database server having processing 
capabilities, including a CPU (not shown), and storage 78. 

60 As one example, the cache server 72 is implemented as a 
SQL (Structure Query Language) database. The cache server 
72 caches Internet resources, such as those requested by 
subscriber computers 58, 60, that have been downloaded 
from the content provider 52 to allow localized serving of 

65 those resources. 

The CMS 74 is a server designed particularly for serving 
continuous data streams, such as video data and audio data, 
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in an ordered and uninterrupted manner. As one example provider acts as an intermediary facilitator which commu- 

implementation, the continuous media server is configured nicates the requests to the appropriate content server and 

as a disk array data storage system consisting of many large then returns the requested content to the appropriate sub- 

capacity storage disks with video and audio data streams scribers. 

stored digitally thereon. The locations of the video and audio 5 The local service provider 110 has a request handler 111 

data streams are kept in a memory map and each video and which manages requests received from the subscribers. In 

audio data stream is accessed through pointers to the par- the Web context, the subscriber computers run Web browser 

ticular memory location. To serve the audio or video data, applications which generate requests in the form of universal 

the processor grabs the pointer to the video stream and resource locators (URLs). A URL describes everything 

begins retrieving the video from the storage disk 82 and 1Q about a particular resource that a Web browser needs to 

streaming it over the communication line 66, 68 to the know to request and render it The URL describes the 

requesting subscriber computing unit protocol a browser should use to retrieve the resource, the 

FIG. 3 shows a network system 90 which is implemented name of the computer it is on, and the path and file name of 

in a local area network (LAN) configuration. This imple- lhe resource. The following is an example of a URL: 

mentation is exemplary of how a company or multi-user 15 ntt P : //www.microsoft.com/upgrades 

organization might be connected to the Internet. The net- ^ "bttp://" portion of the URL describes the protocol, 

work system 90 differs from the system of FIG, 2 in that the ^ letlers " htt P" stand for HyperText Transfer Protocol, the 

local service provider which facilitates the on-ramp connec- set of rules that a browser wiU follow to request a document 

tion to the high-speed, high-bandwidth network 54 is itself !? d ? c rem0 ( e sc ™ wll I fol l ow l ° S " PP & 1 1C d ™ mcnt 

a local server 92 on a LAN 94. The LAN 94 can be 20 "www.microsoft.com portion of the URL ls the name 

constructed using conventional network topologies, such as 2 ° £ th ? r f mo ' C h °t \ W ^ ch m " D ta ^/ he docum ' nt 

r<L t ™ T . ° T 4 , M , r . , t The last portion "/upgrades is the path and file name of the 

Ethernet The LAN network server 92 has a network port 96 doQum ^ Qn ^ T ^ host com £ utef 

which enables a high-speed, high-bandwidth connecUoD 98 when ^ request han(J]er m r£ceives a request> (he [ocai 

to the network 54. The cache server 72 and CMS 74 are service provider 110 first looks to its own cache memory 124 

connected to the LAN 94. Workstations or other computing 25 t0 determine tf a proxy copy of the target res01irce refer . 

units 100, 102 are connected to the LAN 94 and are served enced 5y the URL ^ stored ]ocally ^ cache memory 12 4 

by the LAN network server 92 in regards to Internet access. serves ^ a quas i-temporary local storage for holding proxy 

In this configuration, the LAN users of workstations 100, copies of often ^ and re q Ue sted target resources. The 

102 have access to the Internet through their enterprise LAN cache memory m can be iraplemented using different lypes 

94 and the LAN network server 92. 30 0 f memory, including RAM, storage disks (optical magnetic, 

It is noted that both implementations of FIGS. 2 and 3 are e tc), and the like. If a proxy copy is stored in the cache 

shown and described as suitable examples for implementing memory 124, the target resource is served locally from the 

various aspects of the invention. However, the network caca e memory 124. If there is no proxy copy, the local 

system might be implemented in a variety of arrangements. service provider 110 uses the URL request to locate the 

In addition, the illustrations show the subscriber units as 35 target resource from a content provider and to request 

being personal computers or work stations. However, the delivery of the target resource over the Internet. The local 

subscriber units can be implemented in other forms which service provider 110 passes the target resources on to the 

are capable of rendering content received over the network requesting subscriber and may also cache the target resource 

As examples, the subscriber computing units might include m me cache 124 if the policy rules governing the cache are 

televisions, computers, game devices, handheld devices, and 40 me t. 

the like. The j oca i service provider 110 has a hit recorder 112 

As explained in the Background section, conventional which is coupled to receive the URLs submitted by the 

techniques for delivering video and audio content over the subscribers. For each URL, the hit recorder 112 records hit 

Internet is plagued with latency problems. An aspect of this information in a URL hit database 114. The hit information 

invention is to provide an improved method for delivering 45 includes the date/time of the request, the subscriber who 

streaming audio and video content over a network system. made the request, and other information. The hit recorder 

The technique involves an intelligent, pre-caching and pre- 112 also triggers a pattern recognizer 116 which draws on 

loading of certain content at the local service provider (e.g., information in the URL hit database 114 to detect repetitive 

ISP, POP, LAN network server) prior to optimal or peak access behavior patterns based on subscriber requests. The 

demand times when the content is likely to be requested by 50 pattern recognizer 116 performs statistical analyses using hit 

the subscribers. In this manner, the frequently requested data from the URL hit database to determine usage patterns 

content is already downloaded and ready for access from the that help the local service provider be more responsive to the 

subscribers before they actually request it. When it is finally needs of its clientele. For instance, in the preferred 

requested, the data can be streamed continuously in real- implementation, the pattern recognizer 116 determines 

time for just-in-time rendering from the local service pro- 55 which URLs, and hence which Internet resources, are being 

vider to the subscriber. This eliminates the latency problems requested most often and least often, and the time of day 

of prior art systems. Moreover, intelligently pre-caching when the most requests are received. The pattern recognizer 

content before peak demand times is more effective than H6 is also responsive to operator input to allow adjustment 

traditional on-demand caching because the content is avail- or tuning by the operator for specialized analysis, 

able to the first subscriber who requests it. 6 o A scheduler 118 uses the pattern results generated by the 

FIG. 4 shows a functional block diagram of a local service pattern recognizer 116 to schedule requests for specific 

provider 110 according to one implementation which URLs of target resources on the Internet. The requests are 

enables intelligent pre-caching and pre-loading. At its most scheduled to be filled at pre-selected times prior to the peak 

fundamental level, the local service provider 110 provides times when the highest number of users are most likely to 

an on-ramp connection to the Internet for its subscribers. 65 request the content found at the URLs. Administrative tools 

The subscribers send requests to the local service provider 120 permit the operator to configure various operating 

110 for content available on the Internet. The local service parameters. 
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The pattern recognizer 116 and scheduler 118 cooperate to peak time (step 162 in FIG. 5). When the first subscriber 

enable intelligent pre-caching of frequently requested con- requests the CNN Web page at, for example, 6:40 AM, the 

tent The operation of the local service provider 110 to local service provider 110 serves the Web page from the 

perform this intelligent pre-caching according to an aspect cached memory 124. If the subscriber activates a link to a 

of this invention is described in conjunction with reference s video or audio file, the local service provider 110 immedi- 

to the flow diagram of FIG. 5. The local service provider is atcly ^wes the data stream from the CMS 126 for just-in- 

programmed to perform the computer-implemented steps of , ime ren dering on the subscriber's computer. Accordingly, 

FIG. 5 to alleviate the problems of providing streaming thc vidco fik ^ xpied as strcami daU t0 C vcn the first 

video and audio data over the Internet The steps are pre- subscriber who requests itj rather than makmg that sub . 

sented in the illustrated order for discussion purposes, but scfiber ^ fof { ^ fi , c tQ bc rctricvcd oycr thc & Intemel . 

are not rest ncted to this sequence. ~ . t ... . . , . . . . 

The pattern recognizer U6 monitors the patterns of the ™ e Poaching method obviates the latency 

subscriber requests to determine which content is most V ob } tms associated with streaming video and audio over 

frequently requested and when (step 150 in FIG. 5). From »*» lnternet > aad 15 a fortb « improvement to traditional 

these patterns, the pattern recognizer 116 can identify peak on-demand caching techniques. However, it is noted that the 

times in subscriber traffic and the relation of the peak times 15 network system does not accommodate data streaming for 

to specific requested content (step 152). For instance, sup- every video and audio file on the Internet, but instead only 

pose that a high number of subscribers frequently request the selected files. The system makes an intelligent choice as to 

CNN Web page during the morning hours of 6:30 AM to which content is likely to be requested by its subscribers and 

8:00 AM. These requests translate into a high number of then makes only this content readily available to the sub- 

URL hits for the CNN Web page which are recorded by hit 20 scribers. In this way, the method seeks to optimize the 

recorder 112 in the URL hit database 114. The pattern physical computing resources of the local service provider in 

recognizer 116 recognizes this recurring pattern of requests a manner which best services the majority of the clientele, 

for the CNN Web page and identifies the peak time for this It is noted that the content servers serve many local 

Web page to be between 6:30 AM and 8:00 AM. service providers over the Internet. These local service 

Using the patterns identified by the pattern recognizer 25 providers, in turn, serve many different users. Due to varying 

116, the scheduler 118 schedules delivery of the content at demographics, the local service providers will generally 

a selected time prior to the peak time (step 154 in FIG. 5). differ in the content that it most often serves to its clientele. 

In this example, the scheduler 118 might schedule delivery For example, a service provider in Seattle, Wash., might 

of the CNN Web page at a time prior to 6:30 AM For have many requests for content on entertainment or news 

instance, the scheduler 118 might schedule a request for the 30 local to Seattle. The pattern recognizer for a Seattle -based 

CNN Web page at 6:00 AM to provide sufficient time to service provider might therefore schedule proportionally 

download that page before the earliest subscribers are more Seattle related content than, say, a London -based 

expected to begin asking for it, yet not too early to ensure service provider. As a result, the sets of pre-cached content 

that the latest news is included. may differ significantly from one service provider to another 

At the scheduled time, a media loader 122 sends a request 35 depending upon the results of the local hit recorder and 

to the content server on the Internet and receives the content pattern recognizer. 

from that content server (step 156 in FIG. 5). The content is The system and method described above places the 

stored locally at the local service provider (step 158). More authority for deciding which content is pre-cached at the 

particularly, the data comprising the target resource is stored local service provider. This allows the local service provider 

as a proxy file in the cache memory 124, and any continuous 40 to adapt to the often changing patterns of its clientele, 

data content (e.g., audio or video data) is stored in the However, in another implementation, the content servers can 

continuous media server 126. In the Web context, the be given the governing authority of deciding when and what 

content might be in the form of a Web page or other content to download to the ISPs prior to peak times. For 

hypermedia document that has hyperlinks to various data instance, the content server can maintain a schedule of when 

items, such as audio and/or video clips. The hypermedia 45 to download different sets of content to various ISPs in 

document itself is stored in the cache memory 124, while the timely fashion before the sets of content are requested by the 

audio and vidco clips referenced by thc hyperlinks are stored respective users who arc serviced by thc ISPs, 

in the CMS 126. The target specifications corresponding to With continuing reference to FIG. 4, in this 

the links in thc cached hypermedia document are modified implementation, the local service provider 110 also includes 

to reference the audio and video files in the CMS 126, as 50 a policy manager 128 which defines and administers rules 

opposed to the files maintained at the Web site (step 160 in that determine which documents or resources are cached in 

FIG. 5). As an alternative to modifying the target the cache memory 124. For instance, caching rules might 

specifications, a conversion table can be constructed which call for caching resources that are routinely requested by 

converts requests from referencing the files at the Web site many subscribers, but foregoing caching resources that are 

to referencing the files in the CMS 126 . 55 rarely or infrequently requested. The policy rules also coor- 

In our CNN example, the local service provider 110 sends dinate cache maintenance by deciding when documents are 

a request to the CNN Web site seeking to download the CNN out-of-date and how these documents are deleted from the 

Web page at 6: 00 AM. Thc CNN Web page is downloaded cache memory 124. 

over the Internet and stored in the cache memory 124. If the According to another aspect of this invention, time-to-live 

CNN Web page contains links to any audio or vidco clips of 60 (TTL) tags are assigned to the content to assist in detcrmin- 

recent news, these data files are also downloaded and stored ing when the content should be refreshed or disposed. The 

in thc CMS 126. The links within the cached Web page are TTL tags can be assigned by the content server as part of the 

modified to reference the audio and video files stored locally content itself. The server can attach an expiration tag which 

in the CMS 126, instead of the files maintained at the CNN represents the publisher's best estimate as to how long it will 

Web site. 65 be before the content is updated. 

The media loader 122 loads the locally stored content just Alternatively, the local service providers might compute 

before the peak time so that it is ready to serve during the the TTL tags for the content it caches in cache memory 124. 
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The computation is based upon a theory that older content is 
less likely to change. Content that changed only 10 minutes 
ago is statistically more likely to change within the next 24 
hours than content that last changed one month ago. In one 
implementation, an approximate TTL is computed as a s 
percentage of time since the content is known to have last 
changed. The percentage is an operator controlled param- 
eter. Suppose a 10% value is selected. Content that last 
changed 72 hours ago is assigned a TTL tag of 7.2 hours. If 
the content is not updated within 7.2 hours, it is given a new 10 
value of 7.9 hours (i.e., (72 hours+7.2 hours)xl0%-7.9 
hours). As the content ages, it is checked less often. The Tm 
tags can be kept in a separate table of the cache 124 to 
correlate the tags and their content. 

Deletion policies are a function of the content itself (e.g., 15 
its TTL tags), the subscriber patterns (e.g., how frequently 
the content is requested), the cost to request newer updated 
content, and the constraints imposed by capacity limitations 
of the cache memory. 

The local service provider 110 also maintains a subscriber 20 
database 130 which stores lists of subscribers (or LAN users 
in the LAN configuration) and pertinent information about 
them (e.g., routing addresses, billing addresses, etc.). A 
usage reporter 132 uses the URL hit information from the 
URL hit database 114 and subscriber information from the 25 
subscriber database 130 to generate reports on subscriber 
usage patterns. These reports can be used by the operator to 
efficiently allocate computer resources to best satisfy the 
needs of its clientele. The reports can also be used by content 
providers to help them assess the popularity of their Web 30 
sites and the type of subscribers who visit them. 

In a preferred implementation, the functional components 
described with respect to FIG. 4 are implemented in soft- 
ware which executes on the host computer of the local 
service provider. It is noted that the functional layout is 35 
provided for explanation purposes. The subscriber database 
130, the URL hit database 114, and the cache memory 124 
can be implemented as one database server. Other imple- 
mentation variations may also be made. 

In the above system, the local service providers (e.g., 40 
ISPs, LAN Web servers) initiate the requests for content so 
that it may be pre-cached prior to peak demand times when 
the content is most likely to be requested. This system can 
be referred to as a "pull-caching" system in that content is 
pulled over the Internet upon request of the local service 45 
providers. The method of intelligently pull-caching data 
prior to peak times enables delivery of streaming video and 
audio data to Internet users. 

FIG. 6 shows a network system 200 according to another 
aspect of this invention. The network system 200 attacks the 50 
latency problem of streaming video and audio data by 
supplementing the primary Internet distribution network 
with a second network which is not reliant on the Internet/ 
ISP connection. The Internet/ISP connection is often the 
bottleneck for streaming data and is typically the connection 55 
least likely to be upgraded due to economic factors sur- 
rounding the business of the ISP. Although not required, in 
this implementation, the content may be pushed top down 
from the content provider over the Internet and thus, the 
system may be referred to as a "push-caching" system. 60 

Network system 200 is similar to the configuration of the 
FIG. 2 network system 50 in that it has a content server 52 
which serves content over a high-speed, high-bandwidth 
network 54, Ma local ISPs 56, to end users 58 and 60. The 
difference between the two systems is that network system 65 
200 of FIG. 6 has an additional, secondary network 202 for 
distributing content from the content server 52 to the ISPs 



56. In the illustrated implementation, the secondary network 
202 is a broadcast satellite network. The content provider 52 
has a transmitter 204 which a sends signals to an orbiting 
satellite 206, which redirects the signals to an ISP-based 
receiver 208. 

The secondary satellite network 202 affords a supplemen- 
tal bandwidth for delivery of content to participating ISPs in 
addition to the content delivered over the interactive net- 
work connection 62. For instance, using present DSS (digital 
satellite service) technology, the satellite network 202 pro- 
vides an additional 6 Mbps bandwidth to deliver content to 
the ISP 56. This extra bandwidth is made available at a 
fraction of the cost of buying T1/T3 connections. 

The supplemental-caching technique allows the content 
provider to download more information in a timely manner. 
To continue the above CNN example, the CNN content 
provider might transmit the CNN Web page over the satellite 
system 202 minutes before the peak time of 6:30 AM to 8:00 
AM. The ISP receives the Web page from its satellite 
receiver 208 and caches the Web page for serving during the 
peak time. The CNN Web page is thereby efficiently made 
available for real-time streaming to the subscribers, without 
tying up or consuming any of the bandwidth provided by the 
network connection 62. 

In a preferred implementation, the supplemental satellite 
network 202 is uni -directional, in that data is broadcast from 
the content provider 52 to the ISP 56. It is a low cost solution 
increasing the bandwidth of the pipeline to the ISP, without 
requiring significant investment on the part of the ISR In 
addition to satellite technologies, the broadcast network 202 
can be implemented as other wireless systems, such as RF 
or cellular technologies. 

In another embodiment, the secondary network 202 can 
be implemented as an second data communications network 
whereby supplemental content is multicasted to participat- 
ing ISPs prior to the peak time. 

In compliance with the patent statutes, the invention has 
been described in language more or less specific as to 
structure and method features. It is to be understood, 
however, that the invention is not limited to the specific 
features described, since the means herein disclosed com- 
prise exemplary forms of putting the invention into effect 
The invention is, therefore, claimed in any of its forms or 
modifications within the proper scope of the appended 
claims appropriately interpreted in accordance with the 
doctrine of equivalents and other applicable judicial doc- 
trines. 

What is claimed is: 

1. A method for operating a network system having a 
content provider which provides content through a local 
service provider to multiple content rendering units, the 
content being provided from the content provider to the local 
service provider over a first network, the method comprising 
the following steps: 

distributing supplemental content from the content pro- 
vider to the local service provider over a second 
network; 

assigning a time-to-live tag to the supplemental content to 
indicate when the content is expected to be updated; 
and 

storing selected portions of the supplemental content 
received from the content provider in a cache at the 
local service provider for use in serving the content 
rendering units. 

2. A method as recited in claim 1, and further comprising 
the step of estimating, at the local service provider, a 
time-to -live tag for the supplemental content received from 
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the content provider to indicate when the supplemental 
content is expected to be updated. 

3. A method as recited in claim 2, wherein the estimating 
step comprises deriving the time-to-live tag based upon a 
time since the supplemental content was last updated. s 

4. A system for providing content to user content render- 
ing units, comprising: 

a content provider having storage for storing the content; 

at least one local service provider to facilitate access to the 
content stored at the content provider on behalf of the 30 
content rendering units; 

an interactive network interconnecting the content pro- 
vider and the local service provider, 

a broadcast network; 35 

the content provider being configured to broadcast at least 
some of the content over the broadcast network to the 
local service provider, 
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the content provider being further configured to assign a 
time-to -live tag to the broadcasted content to indicate 
when the content is expected to be updated; and 

the local service provider being configured to cache the 
broadcasted content for serving to requesting content 
rendering units. 

5. A system as recited in claim 4, wherein the content 
provider assigns a time-to- live tag to the broadcasted content 
to indicate when the content is expected to be updated. 

6. A system as recited in claim 4, wherein the local service 
provider is configured to estimate a time-to-live tag for the 
broadcasted content to indicate when the broadcasted con- 
tent is expected to be updated. 

7. A system as recited in claim 4, wherein the local service 
provider is configured to estimate a time-to-live tag for the 
broadcasted content to indicate when the broadcasted con- 
tent is expected to be updated. 
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CERTIFICATE OF CORRECTION 



PATENT NO. : 6,324,182 Bl Page 1 of 2 

DATED : November 27, 2001 

INVENTOR(S) : Burns etal. 



It is certified that error appears in the above-identified patent and that said Letters Patent is 
hereby corrected as shown below: 



Column 1 , 

Line 24, change "&hat" to that --. 
Line 29, insert -- . -- after "Internet". 
Line 42, insert - , -- after "format". 

Column 2, 

Line 14, delete "a" after "of. 

Column 3 , 

Line 20, delete "is". 

Line 20, add - . -- after "data". 

Column 4 , 

Line 4, add -- . -- after "Internet". 
Column 5 , 

Line 26, replace "maimer" with ~ manner --. 
Column 6 , 

Line 3, add -- . - after "data". 

Line 19, add - . -- after "Internet". 

Line 21, replace "mission" with - transmission --. 

Column 8 , 

Line 21, add - . - after "document". 
Column 9 , 

Line 9, add -- . -- after "Internet". 
Line 29, add after "AM". 

Column 11, 

Line 12, replace "Tm" with -- TLL --. 
Line 64, replace "Ma" with via -. 
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